Biblioteca Digital

498 resultados para Duplicate tuples

Optimization of algorithm to identification of duplicate tuples through similarity phonetic based on multithreading

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Aiming to ensure greater reliability and consistency of data stored in the database, the data cleaning stage is set early in the process of Knowledge Discovery in Databases (KDD) and is responsible for eliminating problems and adjust the data for the later stages, especially for the stage of data mining. Such problems occur in the instance level and schema, namely, missing values, null values, duplicate tuples, values outside the domain, among others. Several algorithms were developed to perform the cleaning step in databases, some of them were developed specifically to work with the phonetics of words, since a word can be written in different ways. Within this perspective, this work presents as original contribution an optimization of algorithm for the detection of duplicate tuples in databases through phonetic based on multithreading without the need for trained data, as well as an independent environment of language to be supported for this. © 2011 IEEE.

Enriquecimento de dados: uma pré-etapa em relação à limpeza de dados

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Pós-graduação em Ciência da Computação - IBILCE

Ambiente independente de idioma para suporte a identificação de tuplas duplicadas por meio da similaridade fonética e numérica: otimização de algoritmo baseado em multithreading

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Pós-graduação em Ciência da Computação - IBILCE

Tool for Identification of Duplicate Records Downloaded from Multiple Cd-Roms. A Case Study with Spirs Based Databases

Relevância:

20.00% 20.00%

Publicador:

Resumo:

As research becomes more and more interdisciplinary, literature search from CD-ROM databases is often carried out on more than one CD-ROM database. This results in retrieving duplicate records due to same literature being covered (indexed) in more than one database. The retrieval software does not identify such duplicate records. Three different programs have been written to accomplish the task of identifying the duplicate records. These programs are executed from a shell script to minimize manual intervention. The various fields that have been used (extracted) to identify the duplicate records include the article title, year, volume number, issue number and pagination. The shell script when executed prompts for input file that may contain duplicate records. The programs identify the duplicate records and write them to a new file.

Non Identical Duplicate Video Detection using SIFT Method

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Non-Identical Duplicate video detection is a challenging research problem. Non-Identical Duplicate video are a pair of videos that are not exactly identical but are almost similar.In this paper, we evaluate two methods - Keyframe -based and Tomography-based methods to determine the Non-Identical Duplicate videos. These two methods make use of the existing scale based shift invariant (SIFT) method to find the match between the key frames in first method, and the cross-sections through the temporal axis of the videos in second method.We provide extensive experimental results and the analysis of accuracy and efficiency of the above two methods on a data set of Non- Identical Duplicate video-pair.

The defect sequence for contractive tuples

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We introduce the defect sequence for a contractive tuple of Hilbert space operators and investigate its properties. The defect sequence is a sequence of numbers, called defect dimensions associated with a contractive tuple. We show that there are upper bounds for the defect dimensions. The tuples for which these upper bounds are obtained, are called maximal contractive tuples. The upper bounds are different in the non-commutative and in the commutative case. We show that the creation operators on the full Fock space and the coordinate multipliers on the Drury-Arveson space are maximal. We also study pure tuples and see how the defect dimensions play a role in their irreducibility. (C) 2012 Elsevier Inc. All rights reserved.

Maximal Contractive Tuples

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Maximality of a contractive tuple of operators is considered. A characterization for a contractive tuple to be maximal is obtained. The notion of maximality for a submodule of the Drury-Arveson module on the -dimensional unit ball is defined. For , it is shown that every submodule of the Hardy module over the unit disc is maximal. But for we prove that any homogeneous submodule or submodule generated by polynomials is not maximal. A characterization of maximal submodules is obtained.

Rapid evolution in a pair of recent duplicate segments of rice

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Gene duplication has been considered the most important way of generating genetic novelties. The subsequent evolution right after gene duplication is critical for new function to occur. Here we analyzed the evolutionary pattern for a recently duplicated s

Characteristic Functions for Ergodic Tuples

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Gohm, Rolf; Dey, S., 'Characteristic function for ergodic tuples', Integral Equations and Operator Theory 58(1) pp.43-63 RAE2008

Hypercyclic tuples of operators on $C^n$ and $R^n$

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A tuple $(T_1,\dots,T_n)$ of continuous linear operators on a topological vector space $X$ is called hypercyclic if there is $x\in X$ such that the the orbit of $x$ under the action of the semigroup generated by $T_1,\dots,T_n$ is dense in $X$. This concept was introduced by N.~Feldman, who have raised 7 questions on hypercyclic tuples. We answer those 4 of them, which can be dealt with on the level of operators on finite dimensional spaces. In
particular, we prove that the minimal cardinality of a hypercyclic tuple of operators on $\C^n$ (respectively, on $\R^n$) is $n+1$ (respectively, $\frac n2+\frac{5+(-1)^n}{4}$), that there are non-diagonalizable tuples of operators on $\R^2$ which possess an orbit being neither dense nor nowhere dense and construct a hypercyclic 6-tuple of operators on $\C^3$ such that every operator commuting with each member of the tuple is non-cyclic.

The distribution of k-tuples of reduced residues

Relevância:

20.00% 20.00%

Publicador:

Resumo:

En 1940, Paul Erdős énonça une conjecture sur la distribution des classes inversibles modulo un entier. La présente thèse étudie la distribution des k-uplets de classes inversibles propose une preuve de la conjecture d'Erdős étendue au cas des k-uplets.

Thomas G. Mower [New York, NY] to W. Beaumont [Plattsburgh, NY] regarding: duplicate receipts for monies transferred to Mower. November 16, 1833

Relevância:

20.00% 20.00%

Publicador:

Benjamin King, Acting Surgeon General [Washington, DC] to W. Beaumont [Saint Louis, MO] regarding: return of duplicate vouchers. October 30, 1837. Beaumont’s note regarding receipt of account from the Surgeon General. October 25, 1837

Relevância:

20.00% 20.00%

Publicador:

Memorandum from W. Beaumont [Saint Louis, MO] to George Johnson, Jr. regarding: settlement of accounts with Johnson, rent to be paid for use of Beaumont’s office. November 1, 1849. Memorandum from George Johnson Jr. regarding: list of names and amounts paid in cash from January 2, 1844 through November 27, 1847 with interest he assigned. March 27, 1849. Memorandum from George Johnson, Jr. regarding: duplicate receipt of $1500 paid to Johnson by Beaumont and list of Johnson’s uncollected accounts to January 1, 1848. March 27, 1849. Memorandum from George Johnson, Jr. regarding: memo of old unsettled balances on books through January 1, 1850 and list of names with money owed Beaumont. January 1, 1850

Relevância:

20.00% 20.00%

Publicador:

Comparison of dietary fat and fatty acid intake estimated by the duplicate diet collection technique and estimated dietary records

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Introduction A high saturated fatty acid intake is a well recognized risk factor for coronary heart disease development. More recently a high intake of n-6 polyunsaturated fatty acids (PUFA) in combination with a low intake of the long chain n-3 PUFA, eicosapentaenoic acid and docosahexaenoic acid has also been implicated as an important risk factor. Aim To compare total dietary fat and fatty acid intake measured by chemical analysis of duplicate diets with nutritional database analysis of estimated dietary records, collected over the same 3-day study period. Methods Total fat was analysed using soxhlet extraction and subsequently the individual fatty acid content of the diet was determined by gas chromatography. Estimated dietary records were analysed using a nutrient database which was supplemented with a selection of dishes commonly consumed by study participants. Results Bland & Altman statistical analysis demonstrated a lack of agreement between the two dietary assessment techniques for determining dietary fat and fatty acid intake. Conclusion The lack of agreement observed between dietary evaluation techniques may be attributed to inadequacies in either or both assessment techniques. This study highlights the difficulties that may be encountered when attempting to accurately evaluate dietary fat intake among the population.

«
1
2
3
4
5
6
7
8
...
33
34
»